SEQUENCE LISTING 


(1) GENERAL INFORMATION: 

(i) APPLICANT: Lin, Lin-Ling 

Graham , Jame s 

(ii) TITLE OF INVENTION: NOVEL INTERLEUKIN- 1 RECEPTOR 

INTRACELLULAR LIGAND PROTEINS AND INHIBITORS OF LIGAND 
BINDING 

(iii) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: LEGAL AFFAIRS, GENETICS INSTITUTE, INC. 

(B) STREET: 87 CambridgePark Drive 
<C) • CITY.- Cambridge 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02X40 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version SI. 25 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
'(B) FILING DATE: 
(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME : Brown, Scott A. 

(B) REGISTRATION NUMBER : 32,72 4 

(C) REFERENCE /DOCKET NUMBER: GI52 58 

(ix) TELECOMMUNICATION INFORMATION: 
•-'(A) TELEPHONE: (617) 498-8224 
(B) TELEFAX : (617) 876-5851 


(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1S71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 


( ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION : 2 . .52 9 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

G ATC CCC AGG GTG GAC CTC CGG GTG TGG CAG GAC TGC TGT GAA GAC 

lie Pro Arg Val Asp Leu Arg Val Trp Gin Asp Cys Cys Glu Asp 

15 10 15 


28 


TGT AGG ACC AGG GGG CAG TTG AAT GCC TTT TCC TAT CAT TTC CGA GGC 94 
Cys Arg Thr Arg Gly Gin Phe Asn Ala Phe Ser Tyr His Phe Arg Gly 
20 25 30 

AGA CGG TCT CTT GAG TTC AGC TAC CAG GAG GAC AAG CCG ACC AAG AAA 142 
Arg Arg Ser Leu Glu Phe Ser Tyr Gin Glu Asp Lys Pro Thr Lys Lys 
35 40 45 

ACA AGA CCA CGG AAA ATA CCC AGT GTT GGG AGA CAG GGG GAA CAT CTC 190 
Thr Arg Pro Arg Lys lie Pro Ser Val Gly Arg Gin Gly Glu His Leu 
50 55 60 

AGC AAC AGC ACC TCA GCC TTC AGC ACA CGC TCA GAT GCA TCT GGG ACA 23 8 

Ser Asn Ser Thr Ser Ala Phe Ser Thr Arg Ser Asp Ala Ser Gly Thr 
65 70 75 

AAT GAC TTC AGA GAG TTT GTT CTG GAA ATG CAG AAG ACC ATC ACA GAC 286 
Asn Asp Phe Arg Glu Phe Val Leu Glu Met Gin Lys Thr He Thr Asp 
80 85 90 95 

CTC AGA ACA CAG ATA AAG AAA CTT GAA TCA CGG CTC AGT ACC ACA GAG 334 
Leu Arg Thr Gin He Lys Lys Leu Glu Ser Arg Leu Ser Thr Thr Glu 
100 105 110 

TGC GTG GAT GCC GGG GGC GAA TCT CAC GCC AAC AAC ACC AAG TGG AAA 3 82 

Cys Val Asp Ala Gly Gly Glu Ser His Ala Asn Asn Thr Lys Trp Lys 
115 120 125 

AAA GAT GCA TGC ACC ATT TGT GAA TGC AAA GAC GGG CAG GTC ACC TGC 43 0 

Lys Asp Ala Cys Thr He Cys Glu Cys Lys Asp Gly Gin Val Thr Cys 
130 13 5 14 0 

TTC GTG GAA GCT TGC CCC CCT GCC ACC TGT GCT GTC CCC GTG AAC ATC 47 8 

Phe Val Glu Ala Cys Pro Pro Ala Thr Cys Ala Val Pro Val Asn He 
145 150 155 

CCA GGG GCC TGC TGT CCA GTC TGC TTA CAG AAG AGG GCG GAG GAA AAG 526 
Pro Gly Ala Cys Cys Pro Val Cys Leu Gin Lys Arg Ala Glu Glu Lys 
160 165 170 175 

CCC TAGGCTCCTG GGAGGCTCCT CAGAGTTTGT CTGCTGTGCC ATCGTGAGAT 5 79 
Pro 

CGGGTGGCCG ATGGCAGGGA GCTGCGGACT GCAGACCAGG AAACACCCAG AACTCGTGAC 63 9 

ATTTCATGAC AACGTCCAGC TGGTGCTGTT ACAGAAGGCA GTGCAGGAGG CTTCCAACCA 699 

GAGCATCTGC GGAGAAGGAG GCACAGCAGG TGCCTGAAGG GAAGCAGGCA GGAGTCCTAG 759 

CTTCACGTTA GACTTCTCAG GTTTTTATTT AATTCTTTTA AAATGAAAAA TTGGTGCTAC 819 

TATTAAATTG CACAGTTGAA TCATTTAGGC GCCTAAATTG ATTTTGCCTC CCAACACCAT 8 79 

TTCTTTTTAA ATAAAGCAGG ATACCTCTAT ATGTCAGCCT TGCCTTGTTC AGATGCCAGG 939 

AGCCGGCAGA CCTGTCACCC GCAGGTGGGG TGAGTCTCGG AGCTGCCAGA GGGGCTCACC 999 

GAAATCGGGG TTCCATCACA AGCTATGTTT AAAAAGAAAA TTGGTGTTTG CCAAACGGAA 1059 

CAGAACCTTT GATGAGAGCG TTCACAGGGA CACTGTCTGG GGGTGCAGTG CAAGCCCCCG 1119 

GCCTCTTCCC TGGGAACCTC TGAACTCCTC CTTCCTCTGG GCTCTCTGTA ACATTTCACC 117 9 

ACACGTCAGC ATCTAATCCC AAGACAAACA TTCCCGCTGC TCGAAGCAGC TGT AT AGC CT 123 9 

GTGACTCTCC GTGTGTCAGC TCCTTCCACA CCTGATTAG;* ACATTCATAA GCCACATTTA 12 9 9 


GAAACAGGTT TGCTTTCAGC TGTCACTTGC ACACATACTG CCTAGTTGTG AACCAAATGT 13 59 

GAAAAAACCT CCTTCATCCC ATTGTGTATC TGATACCTGC CGAGGGCCAA GGGTGTGTGT 1419 

TGACAACGCC GCTCCCAGCC GGCCCTGGTT GCGTCCACGT CCTGAACAAG AGCCGCTTCC 14 79 

GGATGGCTCT TCCCAAGGGA GGAGGAGCTC AAGTGTCGGG AACTGTCTAA CTTCAGGTTG 153 9 

TGTGAGTGCG TTAAAAAAAA AAAAAAAAAA AA 1571 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

lie Pro Arg Val Asp Leu Arg Val Trp Gin Asp Cys Cys Glu Asp Cys 
15 10 15 

Arg Thr Arg Gly Gin Phe Asn Ala Phe Ser Tyr His Phe Arg Gly Arg 
20 25 30 

Arg Ser Leu Glu Phe Ser Tyr Gin Glu Asp Lys Pro Thr Lys Lys Thr 
35 40 45 

Arg Pro Arg Lys lie Pro Ser Val Gly Arg Gin Gly Glu His Leu Ser 
50 55 60 

Asn Ser Thr Ser Ala Phe Ser Thr Arg Ser Asp Ala Ser Gly Thr Asn 
65 70 75 80 

Asp Phe Arg Glu Phe Val Leu Glu Met Gin Lys Thr He Thr Asp Leu 
85 90 95 

Arg Thr' Gin He Lys Lys Leu Glu Ser Arg Leu Ser Thr Thr Glu Cys 
100 105 110 

Val Asp Ala Gly Gly Glu Ser His Ala Asn Asn Thr Lys Trp Lys Lvs 
115 120 125 

Asp Ala Cys Thr He Cys Glu Cys Lys Asp Gly Gin Val Thr Cys Phe 
130 135 140 

Val Glu Ala Cys Pro Pro Ala Thr Cys Ala Val Pro Val Asn He Pro 
145 150 155 160 

Gly Ala Cys Cys Pro Val Cys Leu Gin Lys Arg Ala Glu Glu Lys Pro 
165 170 175 


(2} INFORMATION FOR SEQ ID NO : 3 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1088 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

v 

30 


( ix) FEATURE : 

(A) NAME/KEY: CDS 
<B) LOCATION: 2 . . 961 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

G AAA AAA GGA GGT AAA ACA GAA CAG GAT GGC TAT CAG AAA CCC ACC 46 
Lys Lys Gly Gly Lys Thr Glu Gin Asp Gly Tyr Gin Lys Pro Thr 
15 10 IS 

AAC AAA CAC TTC ACG CAG AGT CCC AAG AAG TCA GTG GCC GAC CTG CTG 94 
Asn Lys His Phe Thr Gin Ser Pro Lys Lys Ser Val Ala Asp Leu Leu 
20 25 30 

GGG TCC TTT GAA GGC AAA CGA AGA CTC CTT CTG ATC ACT GCT CCC AAG 14 2 

Gly Ser Phe .Glu Gly Lys Arg Arg Leu Leu Leu lie Thr Ala Pro Lys 
35 40 45 

GCT GAG AAC AAT ATG TAT GTG CAA CAA CGT GAT GAA TAT CTG GAA AGT 190 
Ala Glu Asn Asn Met Tyr Val Gin Gin Arg Asp Glu Tyr Leu Glu Ser 
50 55 60 

TTC TGC AAG ATG GCT ACC AGG AAA ATC TCT GTG ATC ACC ATC TTC GGC 23 8 

Phe Cys Lys Met Ala Thr Arg Lys lie Ser Val He Thr He Phe Gly 
65 70 75 

CCT GTC AAC AAC AGC ACC ATG AAA ATC GAC CAC TTT CAG CTA GAT AAT 286 
Pro Val Asn Asn Ser Thr Met Lys He Asp His Phe Gin Leu Asp Asn 
80 ' 85 90 95 

GAG AAG CCC ATG CGA GTG GTG GAT GAT GAA GAC TTG GTA GAC CAG CGT 3 34 

Glu Lys Pro Met Arg Val Val Asp Asp Glu Asp Leu Val Asp Gin Arg 
100 " 105 110 

CTC ATC AGC GAG CTG AGG AAA GAG TAC GGA ATG ACC TAC AAT GAC TTC 3 32 

Leu He Ser Glu Leu Arg Lys Glu Tyr Gly Met Thr Tyr Asn Asp Phe 
115 12 0 12 5 

TTC ATG GTG CTA ACA GAT GTG GAT CTG AGA GTC AAG CAA TAC TAT GAG 43 0 

Phe Met Val Leu Thr Asp Val Asp Leu Arg Val Lys Gin Tvr Tvr Glu 
130 135 ' ' 140 

GTA CCA ATA ACA ATG AAG TCT GTG TTT GAT CTG ATC GAT ACT TTC CAG 4 76 

Val Pro He Thr Met Lys Ser Val Phe Asp Leu He Asp Thr Phe Gin 
145 150 155 

TCC CGA ATC AAA GAT ATG GAG AAG CAG AAG AAG GAG GGC ATT GTT TGC 526 
Ser Arg He Lys Asp Met Glu Lys Gin Lys Lys Glu Gly lie Val Cys 
160 165 170 175 

AAA GAG GAA GTT GGG GGA GTG TTA GAA CTG TTC CCA ATT AAT GGG AGC 5 74 

Lys Glu Glu Val Gly Gly Val Leu Glu Leu Phe Pro He Asn Gly Ser 
180 185 190 

TCT GTT GTT GAG CGA GAA GAC GTA CCA GCC CAT TTG GTG AAA GAC ATT 622 
Ser Val Val Glu Arg Glu Asp Val Pre Ala His Leu Val Lys Asp lie 
195 200 205 

CGT AAC TAT TTT CAA GTG AGC CCG GAG TAC TTC TCC ATG CTT CTA GTC 670 
Arg Asn Tyr Phe Gin Val Ser Pro Glu Tyr Phe Ser Met Leu Leu Val 
210 215 220 

GGA AAA GAC GGA AAT GTC AAA TCC TC-G TAT CCT TCC CCA ATG TGG TCC 713 
Gly Lys Asp Gly Asn Val Lys Ser Trp Tyr Pro Ser Pro Met Trp Ser 
225 230 * 235 


ATG GTG ATT GTG TAC GAT TTA ATT GAT TCG ATG CAA CTT CGG AGA CAG 7 66 

Met Val He Val Tyr Asp Leu lie Asp Ser Met Gin Leu Arg Arg Gin 
240 245 2S0 255 

GAA ATG GCG ATT CAG CAG TCA CTG GGG ATG CGC TGC CAG AAG ATG AGT 814 
Glu Met Ala lie Gin Gin Ser Leu Gly Met Arg Cys Gin Lys Met Ser 
260 265 270 

ATG CAG GCT ATG GTT ACC ATA GTT ACC ACC AAG GAT ACC AGG ATG GTT 862 
Met Gin Ala Met Val Thr He Val Thr Thr Lys Asp Thr Arg Met Val 
275 280 285 

ACC AGG ATG ACT ACC GTC ATC ATG AGA GTT ATC ACC ATG GAT ACC CTT 910 
Thr Arg Met Thr Thr Val He Met Arg Val He Thr Met Asp Thr Leu 
290 295 300 

ACT GAG CAG AAA TAT GTA ACC TTA GAC TCA GCC AGT TTC CTC TGC AGC 958 
Thr Glu Gin Lys Tyr Val Thr Leu Asp Ser Ala Ser Phe Leu Cys Ser 
305 ' 310 315 

TGC TAAAACTACA TGTGGCCAGC TCCATTCTTC CACACTGCGT ACTACATTTC 1011 

Cys 

320 

CTGCCTTTTT CTTTCAGTGT TTTTCTAAGA CTAAATAAAT AGCAAACTTT C AC CTAAAAA 1071 
AAAAAAAAAA AAAAAAA 1088 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 320 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 : 

Lys Lys Gly Gly Lys Thr Glu Gin Asp Gly Tyr Gin Lys Pre Thr Asn 

15 10 is 

Lys His Phe Thr Gin Ser Pro Lys Lys Ser Val Ala Asp Leu Leu Gly 
20 25 30 

Ser Phe Glu Gly Lys Arg Arg Leu Leu Leu He Thr Ala Pro Lys Ala 
35 40 45 

Glu Asn Asn Met Tyr Val Gin Gin Arg Asp Glu Tyr Leu Glu Ser Phe 
50 55 60 

Cys Lys Met Ala Thr Arg Lys He Ser Val He Thr He Phe Gly Pro 
65 70 75 80 

Val Asn Asn Ser Thr Met Lys He Asp His Phe Gin Leu Asp Asn Glu 
85 90 95 

Lys Pro Met Arg Val Val Asp Asp Glu Asp Leu Val Asp Gin Arg Leu 
100 105 110 

He Ser Glu Leu Arg Lys Glu Tyr Gly Met Thr Tyr Asn Asp Phe Phe 
115 120 125 

Met Val Leu Thr Asp Val Asp Leu Arg Val Lys Gin Tyr Tyr Glu Val 
130 135 140 
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Pro lie Thr Met 
145 

Arg lie Lys Asp 


Glu Glu Val Gly 
180 

Val Val Glu Arg 
195 

Asn Tyr Phe Gin 
210 


Lys Ser Val Phe 
150 

Met Glu Lys Gin 
165 

Gly Val Leu Glu 


Glu Asp Val Pro 

200 

Val Ser Pro Glu 
215 


Asp Leu lie Asp 
155 

Lys Lys Glu Gly 
170 

Leu Phe Pro lie 
185 

Ala His Leu Val 


Tyr Phe Ser Met 
220 


Thr Phe Gin Ser 
160 

lie Val Cys Lys 
175 

Asn Gly Ser Ser 
190 

Lys Asp lie Arg 
205 

Leu Leu Val Gly 


Lys Asp Gly Asn 
225 

Val lie Val - Tyr 


Met Ala lie Gin 
260 

Gin Ala Met Val 
275 

Arg Met Thr Thr 
290 

Glu Gin Lys Tyr 
305 


Val Lys Ser Trp 
230 

Asp Leu lie Asp 
245 

Gin Ser Leu Gly 


Thr lie Val Thr 
280 

Val lie Met Arg 
295 

Val Thr Leu Asp 
310 


Tyr Pro Ser Pro 
235 

Ser Met Gin Leu 
250 

Met Arg Cys Gin 
265 

Thr Lys Asp Thr 


Val lie Thr Met 
300 

Ser Ala Ser Phe 
315 


Met Trp Ser Met 
240 

Arg Arg Gin Glu 
255 

Lys Met Ser Met 
270 

Arg Met Val Thr 
28S 

Asp Thr Leu Thr 


Leu Cys Ser Cvs 
32 0 


(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 59 base pairs 

(B) TYPE: nucleic acid 

_ (C) 3TRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ll) MOLECULE TYPE: cDNA 

(ili) HYPOTHETICAL: NO 


(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2 . . 754 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

C AAA AAC TTC TTC CTG ACG AAT CGC GCC AGG GAG CGC TCA GAC ACC 4 6 

Lys Asn Phe Phe Leu Thr Asn Arg Ala Arg Glu Arg Ser Asp Thr 
IS 10 15 

TTC ATC AAC CTC CGG GAG GTG CTC AAC CGC TTC AAG CTG CCG ZZ~. C-C-A 94 
Phe He Asn Leu Arg Glu Val Leu Asn Arg Phe Lys Leu Pro Pro Gly 
20 2 5 3 0 


GAG TAC ATT CTC GTG CCT TCC ACC TTC GAA CCC AAC AAG GAT C-3G GAT 142 
Glu Tyr He Leu Val Pro Ser Thr Phe Glu Pro Asn Lys Asp Gly Asp 
35 40 45 
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TTC TGC ATC CGG GTC TTT TCT GAA AAG AAA GCT GAC TAC CAA GCT GTC ISO 
Phe Cys lie Arg Val Phe Ser Glu Lys Lys Ala Asp Tyr Gin Ala Val 
50 55 60 

GAT GAT GAA ATC GAG GCC AAT CTT GAA GAG TTC GAC ATC AGC GAG GAT 23 8 

Asp Asp Glu lie Glu Ala Asn Leu Glu Glu Phe Asp lie Ser Glu Asp 
65 70 75 

GAC ATT GAT GAT GGA TTC AGG AGA CTG TTT GCC CAG TTG GCA GGA GAG 286 
Asp lie Asp Asp Gly Phe Arg Arg Leu Phe Ala Gin Leu Ala Gly Glu 
80 85 90 95 

GAT GCG GAG ATC TCT GCC TTT GAG CTG CAG ACC ATC CTG AGA AGG GTT 3 34 

Asp Ala Glu lie Ser Ala Phe Glu Leu Gin Thr lie Leu Arg Arg Val 
100 105 110 

CTA GCA AAG CGC CAA GAT ATC AAG TCA GAT GGC TTC AGC ATC GAG ACA 382 
Leu Ala Lys Arg Gin Asp lie Lys Ser Asp Gly Phe Ser lie Glu Thr 
115 120 125 

TGC AAA ATT ATG GTT GAC ATG CTA GAT TCG GAC GGG AGT GGC AAG CTG 4 3 0 

Cys Lys lie Met Val Asp Met Leu Asp Ser Asp Gly Ser Gly Lys Leu 
130 135 140 

GGG CTG AAG GAG TTC TAC ATT CTC TGG ACG AAG ATT CAA AAA TAC CAA 47 8 

Gly Leu Lys Glu Phe Tyr lie Leu Trp Thr Lys He Gin Lys Tvr Gin 
145 150 155 

AAA ATT TAC CGA GAA ATC GAC GTT GAC AGG TCT GGT ACC ATG AAT TCC 525 
Lys He Tyr Arg Glu He Asp Val Asp Arg Ser Gly Thr Met Asn Ser 
160 165 170 175 

TAT GAA ATG CGG AAG GCA TTA GAA GAA GCA GGT TTC AAG ATG CCC TGT S~~ 
Tyr Glu Met Arg Lys Ala Leu Glu Glu Ala Gly Phe Lys Met Pro Cys 
180 185 190 

CAA CTC CAC CAA GTC ATC GTT GCT CGG TTT GCA GAT GAC CAG CTC ATC ;22 

Gin Leu His Gin Val He Val Ala Arg Phe Ala Asp Asp Gin Leu He 
195 200 205 

•J-4 4 

ATC GAT 'TTT GAT AAT TTT GTT CGG TGT TTG GTT CGG CTG GAA ACG CTA " C 

He Asp Phe Asp Asn Phe Val Arg Cys Leu Val Arg Leu Glu Tnr Leu 
210 215 220 

TTC AAG ATA TTT AAG CAG CTG GAT CCC GAG AAT ACT GGA ACA ATA GAG "IS 
Phe Lys He Phe Lys Gin Leu Asp Pro Glu Asn Thr Gly Thr He Glu 
225 230 235 

CTC GAC CTT ATC TCT TGG CTC TGT TTC TCA GTA CTT TGAAGTTATA ' "6 4 
Leu Asp Leu He Ser Trp Leu Cys Phe Ser Val Leu 
240 245 250 

ACTAATCTGC CTGAAGACTT CTCATGATGG AAAATCAGCC AAGGACTAAG CTTCCATAGA 5 24 

AATACACTTT GTATCTGGAC CTCAAAATTA TGGGAACATT TACTTAAACG GATGATCATA 5 54 

GCTGAAAATA ATGATACTGT CAATTTGAGA TAGCAGAAGT TTCACACATC AAAGTAAAAG 544 

ATTTGCATAT CATTATACTA AATGCAAATG AGTCGCTTAA CCCTTGACAA GGTCAAAGAA i::4 

AGCTTTAAAT CTGTAAATAG TATACACTTT TTACTTTTAC ACACTTTCCT GTTCATAGC-. 1CS4 

ATATTAAATC AGGAAAAAAA AATGCAGGGA GGTATTTAAC AGCTGAGCAA AAACATTGAG 1124 

TCGCTCTCAA AGGACACGAG GCCCTTGGCA GGGAATATTT AAAGCAACT':' CAAGTTTAAA 1154 

ATGCAGCTGT TGATTCTACC AAACAACAGT CCAAGATTAC CATTTCCCAT GAGCCAACTC- 12 44 
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GGAAACATGG TATATCATGA AGTAATCTTG TCAAGGCATC TGGAGAGTCC AGGAGAGAAG 13 04 

ACTCACCTCT GTCGCTTGGG TTAAACAAGA GACAGGTTTT GTAGAATATT GATTGGTAAT 1364 

AGTAAATCGT TCTCCTTACA ATCAAGTTCT TGACCCTATT CGGCCTTATA CATCTGGTCT 1424 

TACAAAGACC AAAGGGATCC TGCGCTTGAT CAACTGAACC AGTATGCCAA AACCAGGCAT 1484 

CCAATTTGTA AACCAATTAT GATAAAGGAC AAAATAAGCT GTTTGCCACC TCAAAACTTT 1544 

ATGAACTTCA CCACCACTAG TGTCTGTCCA TGGAGTTAGA GGGGACATCA CTTAGAAGTT 1604 

CTTATAGAAA GGACACAAGT TTGTTTCCTG GCTTTACCTT GGGAAAATGC TAG C AAC ATT 1664 

ATAGAAATTT TGCCTTGTTG CCTTATCTTC TTCCAAATGT ACTGTTAAAT AAAAATAAAG 1724 

GGTTACCCCA TGCAATCAAA AAAAAAAAAA AAAAA 1759 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 Si amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Lys Asn Phe Phe Leu Thr Asn Arg Ala Arg Glu Arg Ser Asp Thr Phe 
15 10 is 

lie Asn Leu Arg Glu Val Leu Asn Arg Phe Lys Leu Pro Pro Gly Glu 
20 25 30 

Tyr lie Leu Val Pro Ser Thr Phe Glu Pro Asn Lys Asp Gly Asp Phe 
35 40 45 

Cys 'Sle^Arg Val Phe Ser Glu Lys Lys Ala Asp Tyr Gin Ala Val Aso 
50 55 60 

Asp Glu lie Glu Ala Asn Leu Glu Glu Phe Asp lie Ser Glu Asa Aso 
65 70 75 * sc 

lie Asp Asp Gly Phe Arg Arg Leu Phe Ala Gin Leu Ala Gly Glu Asp 
85 90 95 

Ala Glu lie Ser Ala Phe Glu Leu Gin Thr lie Leu Arg Arg Val Leu 
100 105 110 

Ala Lys Arg Gin Asp He Lys Ser Asp Gly Phe Ser He Glu Thr Cys 
115 120 125 

Lys He Met Val Asp Met Leu Asp Ser Asp Gly Ser Gly Lys Leu Gly 
130 135 140 

Leu Lys Glu Phe Tyr He Leu Trp Thr Lys He Gin Lys Tyr Gin Lys 
145 150 155 160 

He Tyr Arg Glu He Asp Val Asp Arg Ser Gly Thr Met Asn Ser Tyr 
165 170 175 

Glu Met Arg Lys Ala Leu Glu Glu Ala Gly Phe Lys Met Pro Cys Gin 
180 185 ISO 

3 5 


Leu His Gin Val He Val Ala Arg Phe Ala Asp Asp Gin Leu lie He 
195 200 205 

Asp Phe Asp Asn Phe Val Arg Cys Leu Val Arg Leu Glu Thr Leu Phe 
210 215 220 

Lys He Phe Lys Gin Leu Asp Pro Glu Asn Thr Gly Thr lie Glu Leu 
225 230 235 240 

Asp Leu He Ser Trp Leu Cys Phe Ser Val Leu 
245 250 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : sxngle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE „: protein 
(iii) HYPOTHETICAL: NO 


(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 7 : 

Met Ala Gly He Ala Ala Lys Leu Ala Lys Asp Arg Glu Ala Ala Glu 
15 10 15 

Gly Leu Gly Ser His Glu Arg Ala He Lys Tyr Leu Asn Gin Asp Tyr 
20 25 30 

Glu Ala Leu Arg Asn Glu Cys Leu Glu Ala Gly Thr Leu Phe Gin Asp 
35 40 45 

Pro Ser Phe Pro Ala He Pro Ser Ala Leu Gly Phe Lys Glu Leu Gly 
50 55 60 

Pro Tyr Ser Ser Lys Thr Arg Gly Met Arg Trp Lys Arg Pro Thr Glu 
65 70 75 SC 

He Cys Ala Asp Pro Gin Phe He He Gly Gly Ala Thr Arg Thr Asp 
85 90 95 

He Cys Gin Gly Ala Leu Gly Asp Cys Trp Leu Leu Ala Ala He Ala 
100 105 110 

Ser Leu Thr Leu Asn Glu Glu He Leu Ala Arg Val Val Pro Leu Asr. 
115 120 125 

Gin Ser Phe Gin Glu Asn Tyr Ala Gly He Phe His Phe Gin Phe Trp 
130 135 140 

Gin Tyr Gly Glu Trp Val Glu Val Val Val Asp Asp Arg Leu Pro Thr 
145 150 155 ISO 

Lys Asp Gly Glu Leu Leu Phe Val His Ser Ala Glu Gly Ser Glu ?r.~ 
165 170 175 

Trp Ser Ala Leu Leu Glu Lys Ala Tyr Ala Lys He Asn Gly Cys Tyr 
180 185 153 

Glu Ala Leu Ser Gly Gly Ala Thr Thr Glu Gly Phe Glu Asp Phe Thr 
195 200 205 
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Gly Gly 
210 

Phe Lys 
225 

Ser He 


Lys Leu 

Glu Ser 

Gly GLu 
290 

Asn Thr' 
305 

Asp Gly 

Arg Leu 

Tyr Lys 

Ser Thr 
370 

Pro Gin 
385 

Gly Glu 

■*A.rg„.Arg 

He Tyr 

Ser Lys 
450 

Phe He 
465 

Glu Tyr 
Phe Cys 
Asp Asp 


Asp He 
530 

Asp Ala 
545 


He Ala Glu 

He He Gin 

Asp He Thr 
245 

Val Lys Gly 
260 

Asn Gly Ser 
275 

Val Glu Trp 

He Asp Pro 

Glu Phe T-rp 
325 

Glu He Cys 
340 

Lys Trp Lys 
355 

Ala Gly Gly 
Tyr Leu He 


Ser Gly Cys 
405 

Gin Arg Lys 
420 

Glu Val Pro 
435 

Asn Phe Phe 


Asn Leu Arg 

He Leu Val 
485 

He Arg Val 
500 

Glu He Glu 
515 

Asp Asp Gly 
Glu He Ser 


Trp Tyr Glu Leu Lys Lys Pro Pro 
215 220 

Lys Ala Leu Gin Lys Gly Ser Leu 
230 235 


Ser Ala Ala Asp 
His Ala 


Leu Gin 


Tyr Ser 
265 

Lys Leu 
280 


Thr Gly 
295 


Arg Trp 
Arg Glu 
Met Ser Phe Ser 


Glu Glu 
310 


Asn Leu 


Leu Thr 


Cys Arg 

375 

Lys Leu 
390 


Thr Pro 
345 

Lys Met 
360 

Asn Tyr 

Glu Glu 


Thr Phe Leu Val 


Met Gly 
Glu Glu 


Leu Thr 
455 

Glu Val 
470 


Glu Asp 
425 

Leu Ser 
440 


Ser Glu Ala He 
250 

Val Thr Gly Ala 


He Arg He Arg 
285 

Asn Asp Asn Cys 
300 

Arg Leu Thr Arg 
315 

Asp Phe Leu Arg 
330 

Asp Thr Leu Thr 


Asp Gly Asn Trp 
365 

Pro Asn Thr Phe 
380 

Glu Asp Glu Asn 
395 

Gly Leu He Gin 
410 

Met His Thr He 


Gly Gin Thr Asr. 

445 


Asn Arg Ala Arg Glu Arg 
460 

Leu Asn Arg Phe Lys Leu 
47 5 


Pro Ser Thr Phe 


Phe Ser 


Ala Asn 


Val Arg 
535 

Ala Phe 
550 


Glu Lys 
505 

Leu Glu 
520 

Arg Leu 
Glu Leu 


Glu Pro Asn Lvs 
490 

Lys Ala Asp Tyr 


Glu Phe Asp He 

535 

Phe Ala Gin Leu 
540 

Gin Thr He Leu 
555 


Pro Asn Leu 

Leu Gly Cys 
240 

Thr Phe Gin 
255 

Glu Glu Val 
270 

Asn Pro Trp 

Pro Ser Trp 

Arg His Glu 
320 

His Tyr Ser 
335 

Ser Asp Thr 
350 

Arg Arg Gly 

Trp Met Asn 

Glu Glu Asp 
400 

Lys His Arg 
415 

Gly Phe Glv 
4 30 

He His Le_ 
Ser Asd Thr 


Pro Pro Gly 
480 

Asp Gly Asp 
495 

Gin Ala Val 
510 

Ser Glu Asd 


■.la Giv Glu 


Arg Arg Val 
5 6 C 
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Leu Ala Lys Arg Gin Asp 
565 

Cys Lys lie Met Val Asp 
580 

Gly Leu Lys Glu Phe Tyr 
595 

Lys lie Tyr Arg Glu He 
610 

Tyr Glu Met Arg Lys Ala 
625 630 

Gin Leu His Gin Val lie 
645 

He Asp Phe Asp Asn Phe 
660 

Phe Lys He Phe Lys Gin 
675 

Leu Asp Leu He Ser Trp 
690 


He Lys Ser Asp Gly 
570 

Met Leu Asp Ser Asp 
58 5 

He Leu Trp Thr Lys 
600 

Asp Val Asp Arg Ser 
615 

Leu Glu Glu Ala Gly 
635 

Val Ala Arg Phe Ala 
650 

Val Arg Cys Leu Val 
665 

Leu Asp Pro Glu Asn 
680 

Leu Cys Phe Ser Val 
695 


Phe Ser He Glu Thr 
575 

Gly Ser Gly Lys Leu 
590 

He Gin Lys Tyr Gin 

605 

Gly Thr Met Asn Ser 

620 

Phe Lys Met Pro Cys 
640 

Asp Asp Gin Leu He 
655 

Arg Leu Glu Thr Leu 
670 

Thr Gly Thr He Glu 
685 

Leu 
700 


33 


v 


